A Deterministic Seeding Approach for k-means Clustering
نویسندگان
چکیده
منابع مشابه
K-Medoids For K-Means Seeding
We run experiments showing that algorithm clarans (Ng et al., 2005) finds better Kmedoids solutions than the standard algorithm. This finding, along with the similarity between the standard K-medoids and K-means algorithms, suggests that clarans may be an effective K-means initializer. We show that this is the case, with clarans outperforming other popular seeding algorithms on 23/23 datasets w...
متن کاملDeterministic Initialization of the K-Means Algorithm Using Hierarchical Clustering
K-means is undoubtedly the most widely used partitional clustering algorithm. Unfortunately, due to its gradient descent nature, this algorithm is highly sensitive to the initial placement of the cluster centers. Numerous initialization methods have been proposed to address this problem. Many of these methods, however, have superlinear complexity in the number of data points, making them imprac...
متن کاملA hybridized K-means clustering approach for high dimensional dataset
Due to incredible growth of high dimensional dataset, conventional data base querying methods are inadequate to extract useful information, so researchers nowadays is forced to develop new techniques to meet the raised requirements. Such large expression data gives rise to a number of new computational challenges not only due to the increase in number of data objects but also due to the increas...
متن کاملPersistent K-Means: Stable Data Clustering Algorithm Based on K-Means Algorithm
Identifying clusters or clustering is an important aspect of data analysis. It is the task of grouping a set of objects in such a way those objects in the same group/cluster are more similar in some sense or another. It is a main task of exploratory data mining, and a common technique for statistical data analysis This paper proposed an improved version of K-Means algorithm, namely Persistent K...
متن کاملBalanced K-Means for Clustering
We present a k-means-based clustering algorithm, which optimizes mean square error, for given cluster sizes. A straightforward application is balanced clustering, where the sizes of each cluster are equal. In k-means assignment phase, the algorithm solves the assignment problem by Hungarian algorithm. This is a novel approach, and makes the assignment phase time complexity O(n), which is faster...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Scientific Research in Computer Science, Engineering and Information Technology
سال: 2021
ISSN: 2456-3307
DOI: 10.32628/cseit217246